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CLAIMS: 



1. A method fdr obtaining a candidate nucleotide sequence, the candidate 
nucleotide sequence \being indicative of a sequence of a target polynucleotide 

5 molecule T, T producing a hybridization signal I(x) upon incubating T with a 
polynucleotide x for 4ach polynucleotide f in a set E of polynucleotides, the 
method comprising the ateps of: 

(a) for each polynucleotide x in the set E of polynucleotides, obtaining a 
probability P 0 (x) of the V hybridization signal I(x) when the sequence x is not 

10 complementary to a subsequence of T and a probability Pi(x) of the hybridization 
signal when the sequence Y? is complementary to a subsequence of T; so as to 
obtain a probabilistic spectrum (PS) of T; 

(b) assigning a scdre to each of a plurality of candidate nucleotide 
sequences, the score being baled upon the probabilistic spectrum and upon at least 
one reference nucleotide sequgr 

(c) selectin^6ne or biore c^hdidate nucleotide sequences having an 
essentially maximal score. 

2. The method according to (^laimX^wherein the polynucleotides jc in the set 
E are immobilized on a surface. 

to Clafom 1 m& wherein the set E is a set of k-mers. 
to ClaJrff 3 wherein E is the set of all k-mers formed 
from nucleotides from a prg3eterminfed set of nucleotides.. 

5. The method of Claim 4 whenein the predetermined set of nucleotides is 
selected from the group consisting of ' 

(a) adenine, guanine, cytosirle, and thymine; and 



15 



20 3, 

4. 



The method according 
The method according 
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(b) adenine, guanine, cytosii 



V 6, The method according to aajUi 



, uracil. 



s- o £ 4he previca isjlLaiinSy wherein the score 
of a candidate nucleotide sequeice f is based upon Z c (f) where 
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wherein T(x)=0 if 
f and f (x)= 1 if th 



the sequence of x is not complementary to a subsequence of 



sequence of x is complementary to a subsequence of f . 
7. The method according to^ ny one of Clahm 1 l u 6 , wherein the score of a 

candidate sequence fi* based upon £ c (f) where log Z*(t) wherein f 

Me,) 



i=0 



contains polynucleotides L, . . ,e m and m{e i ) = log 



8. The method according to ^j * nj!f P iirnTL pieviuus claims* , wherein T and H 
have a common length. 

9. The method according to Claim 8, wherein the score of a candidate 

sequence f is based upon Dj (T) where D u (f)= [[^[^ ,h ,] , wherein M G) [t j? hj] 

is a probability of a nucleotide tj in position j of T being replaced with nucleotide hj 
in position j of H. 

10. The method^gordind to C^im 9, wherein the score of a candidate 
nucleotide sequenVe f ys Sconrj (a), or Score u 2 (f) where 
Score^{f)=logL e (f)+Y^ ani^re u 2 (f)=lo^^ 

11. The method according tojCiaarriO wherein the polynucleotides in the set E 
are k-mers and the step^5fselebting a candidate nucleotide sequence having an 
essentially maximal score comprises the steps of 

\ k ~ l r 

(a) For each (k- 1 )-mer j/fcalculating ^ w [y , £ - 1] = ^ L {i) [y h j 

(b) for each integer j = k, 1. .1, 
(ba) for each polynucleotide sequence (yi, . . . y k _]) 

(baa) calculati 



(/) 



wherein L 0) [y, hj]= log 
(bab) selecting 



U-i>^]+ max {S u [z,j-l] + a>(e)\ 

a(k-l)-merP[j; j] 

{ 



« 
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max 



satisfying\ 

S"W,j],j -l] +4fl?, A?)) = max [S"fey-l] +a( e )} 

(c) selectii^ a (k-l)-mer Z 1 having a score essentially equal to 

S u [yj]; 

(d) for j=k-i„.,l-l; recursively calculating (k-l)-mers Z j where 



(e) selecting candidate target sequence <z 



k-l 



k-l k-l k 

Z 2 ? -..Z k-l, Z k-l, 



z k+1 k _i ? . . .z ! k-i> 9 where Z J =kz\ z*2. . .z*k-i> 
12. The method accordW to Claim 9, wherein the polynucleotides in the set E 
are k-mers, and the step df selecting a candidate nucleotide sequence having an 
essentially maximal score comprises the steps of: 

(a) If the length / ^f the target is greater than the predetermined constant, 

l+k-l 
settting m - — ~ 

(b) For e#6h j\= k - 
for all y 

(c) For each j 



rn, computing S u [yj] according to Claim 10 
m f computing R u [yj] according to 



equations (14) and (15) fo: all y\ 

(d) Selectingl^^ ; 

(e) Computing the optimkl sequence aligned to (A,... h m ) ending with y m9 

and the optimal sequence aligned to\(h m „h t ) beginning with y m . 

cAwvrv^ 1 

13. The method according te ^nykmo of Claim3 1 to 7, wherein H and T have 
lengths such that the length of T is less than the length of H. 

14. The method according to Claim 13, wherein the step of assigning a score to 
each of a plurality of candidate nucle&tide sequences and the step of selecting the 
candidate target sequence are performed according to Algorithm B. 
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15. The methdd according to ^my uue uf Claims 1 LO 1\ wherein H and T have 
arbitrary lengths. 

16. The method According to Claim 15, wherein the step of assigning a score to 
each of a plurality oAcandidate nucleotide sequences and the step of selecting the 
candidate target sequence are performed according to Algorithm C. 

17. The method according to Claim 15, wherein the step of assigning a score to 
each of a plurality of candidate nucleotide sequences and the step of selecting the 
candidate target sequence We performed according to Algorithm D. 

18. The method according to Claim 17 wherein a Hidden Markov Model is used 
l o instead of a reference sequence. 

19. The method according* to ^any one - of Claims 1 to 1 1", wherein the algebraic 
equation (12a') replaces the amgbFaic-^quation (12a), the algebraic equation (12b ! ) 
replaces the algebraic^etmation (12b) ? the algebraic equation (15 f ) replaces the 
algebraic equation (fl5), \and tl\e algebraic equation (16 1 ) replaces the algebraic 

15 equation (16). 
h 20. 

algebraic equation (20') replaces ^he algebraic egdation (20), and the algebraic 

equation (21') replaces the algebraic\equatiop^2l). 

jolt 

21. The method according v to^ t^^ne of Claims 15, 17, or 18, wherein the 
20 algebraic equation (29 1 ) replaces the algebraic equation (29), and the algebraic 

equation (30') replaces the algebraic equation (30). 
pc 22. The method according toj pny one of the previous-claims w herein the target 

comprises two or more polynucleotide molecules. 
I* 23. The method according to^ gay oim of Claims 1 to 22 computing the exact 
25 score L c (j) for several candidate sequerlpes chosen according to the value of the 

approximated score L c '{f). 
$c 24. The method according to ^ony one otthc picvious claims further comprising 
a step of deleting candidate sequences having likelihood below a predetermined 
value. 



\ c \ou r*. \ 

The method according te tany one of Cle ftr nfl 1 3 ;H, o r 1% wherein the 
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25. The method According to ^an)^ one uf Claims 1 lu 5, 0 lu 24, wherein the 
score of a candidate nucleotide sequence f is based upon Z°(f) where 



x€.A 



5 



wherein f (x) = r if thd sequence of x is complementary to exactly r subsequences 
off. 

26. The method according to ^rny one o f the pievlom Claiiri§ T wherein the set E 

of polynucleotide does notWlude all the polynucleotide of a specific length. 

\ clAt/^* * 

* 27. The method according to ^ony ono of tin pnviuus ildinis * , wherein the set E 
of polynucleotide includes polynucleotides of different lengths. 

io^ 28. The method accordinAto/ \a^mL uf the pievious chri tfifrfor use in a task 
selected from the group comprising: 

(a) Detecting or genoWping of Single Nucleotide Polymorphisms. 

(b) Detecting or genotVping of genetic syndroms or disorders. 

(c) Detecting or genot^ing^somatic mutations. 

(d) Sequencip^^ polynucleotide whose function is related to the 
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function of the reference pownucle^ 
29. The method according to t 



polynucleotides contain gaps 
30. The method accord 



)ti« 



iQ of the previous claims , wherein 

or universal ' 

one of the previous claims wherein 
20 polypeptides are sequenced instead of polynucleotides. 

31. A program storage device readable by machine, tangibly embodying a 
program of instructions executable bytthe machine to perform method steps for 
obtaining a candidate nucleotide sequencte, the candidate nucleotide sequence being 
indicative of a sequence of a target polynucleotide molecule T, T producing a 
25 hybridization signal I(x) upon incubating T with a polynucleotide x for each 
polynucleotide x in a set E of polynucleotides, the method comprising the steps of: 
(a) for each polynucleotide x in tfte set E of polynucleotides, obtaining a 
probability Po(x) of I(x) when the sequence x is not complementary to a 
subsequence of T and a probability Pi(ic) of I(x) when the sequence x is 
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complementary to a subsdquence of T; so as to obtain a probabilistic spectrum (PS) 
ofT; 

(b) assigning a Wore to each of a plurality of candidate nucleotide 
sequences, the score being &ised upon the probabilistic spectrum and upon at least 

5 one reference nucleotide sequence H; and 

(c) selecting a candidate nucleotide sequence having an essentially 
maximal score. 

32. A computer program pVoduct comprising a computer useable medium 
having computer readable program code embodied therein for obtaining a 
10 candidate nucleotide sequence, the candidate nucleotide sequence being indicative 
of a sequence of a target polynucleotide molecule T, T producing a hybridization 
signal I(x) upon incubatingJfW^ x for each polynucleotide x 

in a set E of polynucleotides, the computer program product comprising: 

(a) for eacftspoVnucleoti^e Ain the set E of polynucleotides, computer 
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readable program code foi 
I( x ) the sequence x is not 



causing the coriwuter to obtain a probability Po(3c) of 
complemdnta^to a subsequence of T and a probability 
Pi( x ) of I( x ) when the seqiienge^Pls^omplementary to a subsequence of T; 

(b) computer readable program code for causing the computer to assign a 
score to each of a plurality of candickte nucleotide sequences, the score being 

20 based upon the probabilistic spectrum and upon at least one reference nucleotide 
sequence H; and 

(c) computer readable program dode for causing the computer to select a 
candidate nucleotide sequence having an essentially maximal score. 




